rank | frequency | n-gram |
---|---|---|
1 | 12006 | -s |
2 | 11124 | -a |
3 | 5800 | -t |
4 | 3676 | -r |
5 | 3668 | -n |
rank | frequency | n-gram |
---|---|---|
1 | 4260 | -es |
2 | 2261 | -nt |
3 | 2109 | -at |
4 | 1951 | -ia |
5 | 1887 | -ar |
rank | frequency | n-gram |
---|---|---|
1 | 1327 | -ent |
2 | 1283 | -ció |
3 | 991 | -ada |
4 | 802 | -ats |
5 | 761 | -tat |
rank | frequency | n-gram |
---|---|---|
1 | 955 | -ment |
2 | 882 | -ació |
3 | 531 | -ades |
4 | 521 | -ions |
5 | 461 | -itat |
rank | frequency | n-gram |
---|---|---|
1 | 577 | -ament |
2 | 415 | -cions |
3 | 290 | -ar-se |
4 | 264 | -ència |
5 | 211 | -iques |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings